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The subroutine package TREEPAK is designed to interface a PL/ I 
program with a graphic display unit, allowing display, creation and 
manipulation of tree structures, while providing the user with a 
graphic representation of the tree. The package is implemented under 
IBM System/360 using the IBM 2250-1 graphic display unit as the pri- 
mary input/output device. Facilities include creation of trees from 
the alphanumeric keyboard and cards, adding and deleting subtrees in 
existing trees, and saving and retrieving trees. Trees may be passed 
between TREEPAK and the user written main program. An implementation 
of the Wang Algorithm is given as an example application program. 
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I . INTRODUCTION 



A. STATEMENT OF THE PROBLEM 

The objective of this thesis was to develop an interactive sub- 
routine package which interfaces a PL/I program with a graphic dis- 
play unit, allowing display, creation and alteration of tree structures 
in graphic form. There is a large class of problems in which a hier- 
archical relationship exists among the data, and a tree type data 
structure is often an efficient and effective means of representing 
this relationship. This package, called TREEPAK, is designed to pro- 
vide the capability of on-line input and manipulation of data in tree 
structures with the additional facility of allowing the user to view 
a graphic representation of the tree. TREEPAK is quite general and 
may be accessed from any PL/ I program. Figure 1 illustrates the gen- 
eral configuration of a system using TREEPAK. 

Various terminology has been used to characterize trees and the 
nodes of a tree. The terminology used throughout this thesis is de- 
scribed in order to clarify the discussion of TREEPAK. 
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B. TERMINOLOGY 



A tree has been formally defined j_ 1_/ as a finite set T of one or 
more elements, called nodes , such that (a) there is a distinguished node 
R called the root , and (b) the remaining nodes (excluding the root) are 
partitioned into m^O subsets T^,T2 .*♦ T m where each of these subsets 
is in turn a tree. Although no relationship between nodes is expressed 
by this definition, it is assumed that the root of any particular tree 
is logically related to the remaining nodes. 

The trees T^,T 2 • T m are called the subtrees of R. This defini- 
tion is represented graphically in Figure 2, which shows a tree with ten 
nodes. The root A has three subtrees whose roots are B, C and D. B, C 
and D have one, zero and two subtrees respectively. 




Figure 2. Graphic Representation of a Tree. 
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The roots of the subtrees of a node are called the successors of 



that node. A node is the father or predecessor of its successors. The 
number of subtrees of a node is its degree . Thus the degree of A is 
three, the degree of B is one, and C has degree zero. A node which has 
no successors is called a terminal node . 

The level of a node with respect to the root is defined by the fol- 
lowing: the root is at level one, and the level of every other node is 
one plus the level of its father. Thus B, C and D are at level two; E, 

F and G are at level three. Although these definitions apply to graphic 
trees as shown in Figure 2, they may be applied to tree structures used 
in list-processing as well. 

A familiarity with list-processing fundamentals is assumed, however, 
a presentation of these concepts is found in references 1 and 2. It 
should be noted that the essential difference between a tree and the more 
general list is that a tree does not contain cycles, i.e., there is a 
unique path from the root to any node in the tree. However, the "sub- 
lists" of lists need not be disjoint. A list may even be a "sublist" of 
itself. This difference is important in application, but in a graphic 
display any list may be represented as a tree if some node is designated 
as the root. Further discussion of this point is found in Section II,C. 

As used in list-processing, the internal representation of a tree 
node consists of some number of memory cells which contain the informa- 
tion associated with the node and a means for locating its successors 
or its predecessors, or both. The content, format, and representation 
of the information is entirely dependent on the application. The nodes 
may contain no information at all, e.g. , the non-terminal nodes of a 
LISP tree. In a hierarchical data retrieval system each node may contain 
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large amounts of coded data in various formats* The information asso- 
ciated with the node will hereafter be referred to as the name of the 
node, since the information may be thought of as an interpretation of 
the name * 

A means for locating the successors or predecessor of a node is 
required for traversing the tree, i 6 e,, visiting the nodes in some de- 
sired order. The following recursive algorithm is commonly used for 
traversing trees: (a) visit the root, then (b) visit each of the suc- 

cessors of the root. In an application which uses this algorithm for 

V 

traversing trees there must be a means for locating the successors of 
a node Frequently this is provided by pointers contained within the 
node. In this context a pointer is simply a number which represents 
the address in memory of a successor, or "points to" the successor* 
Other representations of trees are found in references 1 and 2* The 
most suitable representation depends primarily on the order in which 
the nodes are visited, 

t 

The algorithm given in the preceding paragraph for traversing a 
tree reflects the recursive nature of trees* This recursive nature 
comes from similarity of subs true lures 0 As stated in the definition 
of a tree, each node is the root of a tree, This property allows 
each node to have the same format in memory regardless of its degree. 

In the next section two specific applications which use tree- 
type data structures are described, 

C . BACKGROUND 

Tree-type information structures arise in many computer appli- 
cations* Of particular interest in the development of this thesis 
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were those applications which use tree structures in interactive environ- 
ments. The Transformational Grarmner Tester (TGT) at Systems Development 
Corporation 3_/ is one such application* Transformational grammers are 
built on the concept of the logical separation of two types of grammati- 
cal structure, the deep and surface structures. Accordingly, there are 
two systems of rules associated with the syntactic component of the gram- 
mer. Phrase structure rules generate deep structure taking the form of 
labeled trees, and transformational rules map trees to other trees de- 
termining the ultimate surface structure of a sentence. As the grammer 
becomes large and the linguist attempts to account for more phenomena in 
the language he is describing, it becomes more difficult to provide for 
all the interrelationships of the rules. TGT was designed to assist the 
linguist in analyzing, refining, and extending the grammer initially 
specified. 

The most important tasks performed by TGT center around its ability 
to execute transformations. Using TGT, the linguist can determine the 
applicability of transformations, execute them, and display the results. 
Facilities are available for creating, displaying, and manipulating 
trees which represent the structure of a sentence much like the facil- 
ities available in TREEPAK. However, the graphic display unit is only 
an auxiliary output device in the TGT system. Input is through a tele- 
typewriter terminal, thus, all commands and operands must be typed, as 
opposed to quicker and more natural means available when the graphic 
display unit is the primary input /output device. 

J. H. Bennett and others describe a series of computer programs 
developed at Air Force Cambridge Laboratories J_ 4_/. These programs, 
called SAM, are experimental tools for studying techniques in theorem 
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proving using man-machine interaction. Given a finite set of logical 
formulas, SAM attempts to generate "interesting" consequences by using 
four processes called reduction, expansion, digression, and contradic- 
tion. Reduction uses a set of formulas from the original set to reduce 
or simplify a given formula using logical rules of the Predicate Cal- 
culus. Expansion and digression use these same rules to generate new 
formulas from the initial set of formulas. Contradiction attempts to 
eliminate "trivial" formulas by finding a contradiction of the negation 
of a formula. SAM applies these four processes in a pattern which al- 
lows the newly generated formulas to stay in the set only if they can- 
not be reduced by reduction or eliminated by contradiction. 

Using the graphic display unit as the primary input /output device, 
the user initiates action by setting up an initial list of formulas. 

SAM then starts to generate consequences of these formulas. As SAM 
works on the lists of expansions and reductions, the user is able to 
watch these lists on the display. Each new formula generated appears 
on the display and each formula eliminated disappears as SAM updates 
the current list of formulas. The user may intervene at any point in 
the process. If S.\M is running out of useful things to do with the 
formulas first given, the user may insert additional formulas. He may 
guide the process by deleting formulas which seem to be unimportant or 
distracting. 

Some of the initial formulas may be marked by asterisks to indi- 
cate that they are the negations of formulas whose proofs are being 
sought. In this case, all the consequences of the original starred 
formulas are starred. It is hoped that SAM can derive a contradic- 
tion. If so, SAM ha ; demonstrated by contradiction that the disjunc- 
tion of the unnegated versions of the original starred formulas is a 
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logical consequence of the other initial formulas (axioms and theorems). 
The authors report that one result of their experimentation has been the 
actual solution by man-machine interaction of an open problem in Lattice 
Theory. 

The significance of systems of this type seems to be that by using 
man-machine interaction, results may be obtained which neither could ob- 
tain alone. The two areas mentioned, language analysis and theorem 
proving, are still in the early stages of their development and complete 
algorithmic processes are not yet known. It appears to be particularly 
advantageous for applications of these types to have the capability of 
human intervention and guidance when necessary, thus taking advantage 
of the speed and logical power of the computer, as well as the insight 
and experience of the human. When the application also uses a hierar- 
chical data structure, it seems that the ability to view and interact 
with the structural relationships in graphic form would enrich the in- 
teractive process. 

In the next section a description of TREEPAK is given. The fol- 
lowing section describes an example application program which allows 
the user to interact with an implementation of the Wang Algorithm 
using TREEPAK. 
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II. DESCRIPTION OF TREEPAK 



A # GENERAL OVERVIEW 

TREEPAK, which consists of approximately nine hundred PL/ I state- 
ments, is designed to provide any PL/I program the capability of dis- 
playing and manipulating tree structures . PL/ I was chosen as the 
implementation language for two reasons: (a) general list processing 

features are available in PL/I, and (b) recursive subroutine calls are 
allowed. As previously stated, algorithms for traversing trees are 
inherently recursive in nature and the availability of a language cap- 
able of recursion is a significant asset in programming. 

The IBM 2250-1 Graphic Display Unit is the terminal input/output 
device for TREEPAK. This unit is equipped with an alphanumeric key- 
board, light pen, and programmed function keyboard (PFK) . All three 
of these facilities are utilized to perform various functions. Maxi- 
mum use is made of the light pen and PFK since their use is more 
natural than using the alphanumeric keyboard. The alphanumeric key- 
board is used only when it is necessary to type a name or a list. 

The interface between TREEPAK and System 360 graphics routines 
is provided by IBM Contributed Program 360°00. 6.009 (WJS999) /_ 5 _/. 
This subroutine package provides access to the graphic input /output 
routines and Graphics Problem Oriented Routines included in references 
6 and 7. The package also allows interrupts from the console to be 
handled with PL/ I ON CONDITIONS. 
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B. ENTRY AND EXIT CONDITIONS 



TREEPAK is designed to run as a subroutine to a user-written in? in 
program. Two arguments are required in the call to TREEPAK: a PL/ I 

pointer variable and a character string. If the value of the pointer 
variable is not the NULL pointer it is assumed to point to the root of 
a tree which exists in the main program at the time of the call. The 
tree and the character string are both displayed on the screen as a re- 
sult of the call. After TREEPAK is entered all of its capabilities are 
available to the user, including creating and displaying new trees, 
saving the tree currently on display, and altering the tree on display. 
A detailed account of the capabilities is given in the next section. 
When the user returns control to the main program a pointer to the root 
of the original tree is returned. 

The required format of the nodes of the tree is shown in Figure 3. 
This format is a direct result of the following PL/I structure 
declaration : 



DCL BOUND FIXED BINARY, 



1 NODE BASED (P) , 

2 NAME CHAR (10), 

2 DEGREE FIXED BINARY, 

2 BRANCH (0: BOUND REFER (DEGREE)); 




Node Name 



Node Degree- 





required 



Figure 3. Node Format 
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The first ten bytes contain the name of the node. These ten bytes must 
be EBCDIC coded alphanumeric characters. The next four bytes contain a 
fixed binary integer which is the degree of the node. The node degree 
is followed by four-byte pointers to the successors of the node. These 
pointers contain the absolute internal address of each successor. The 
nodes are declared as PL/ 1 based structures £ 8 and 9 _/. 

The NULL pointer may be used as an argument in the call to TREEPAK. 
In this case no tree will be displayed until one is created by the user 
Hence, there is no requirement that a tree exist in the main program at 
the time of the call. If TREEPAK is entered with the NULL pointer, a 
pointer to the tree on display is returned to the main program. This 
allows the user to create a new tree while in the subroutine, manipu- 
late the tree, and pass it back to the main program for analysis. 

C. CAPABILITIES 

The capabilities for operations on trees include creating new trees 
adding a structure to an existing tree, and deleting all or part of a 
tree. Operations are performed on the "current 11 tree, i.e., the tree 
currently on display. The following sections contain a general de- 
scription of each of these operations. Detailed instructions on how 
to perform each operation are given in Appendix A, Before continuing 
a discussion of the capabilities, a description of the display screen 
is given. 

The tree being displayed is centered on the screen, with equal 
vertical distance between each level. All nodes on the same level 
appear on a straight line equidistant across the screen. This spac- 
ing takes maximum advantage of the display area of the screen since 
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the tree appears as large as possible for that size tree. Each node is 
numbered, with node names corresponding to each number listed along the 
sides of the tree. The list notation described in the next section is 
shown across the top of the screen. Messages to the user, if applicable, 
appear on the bottom line of the screen® See Appendix C for photos of 
typical displays. 

If the current tree has many levels, or many nodes at some level, 
it may not be possible to display all of the tree at the same time. To 
do so would make the screen completely unreadable. Thus, a maximum of 
ten levels and twenty nodes at each level is displayed. In this case 
some nodes have successors which do not appear on the screen. These 
nodes are marked with a "-f" on the display. The substructures of these 
nodes may be displayed separately (see Appendix A, PFK key 4). 

1 . Creation of Trees 

Trees may be created either by reading them from cards in the 
input stream or they may be typed at the console using the alphanumeric 
keyboard. The tree in Figure 2 could be created by typing the follow- 
ing string of symbols: 

(A (B (E (H, I,J)),C,D(F S> G))), 

The rules for this notation are: (a) successors of a node are paren- 

thesized immediately following that node, (b) nodes on the same level 
are separated by commas, and (c) the entire list is enclosed in paren- 
theses. This notation is completely general and may be used to unambig- 
uously describe any tree in which all nodes are named. Only one node in 
a tree appears at level one. TREEPAK only creates one tree at a time, 
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thus care must be taken to ensure that the expression contains exactly 
one node at level one. If the expression represents more than one tree, 
only the first tree in the list will be created. 

2c Altering the Tree Structure 

The PFK and light pen are used for manipulating the tree on 
display. Operations are performed on the M identif ied" node. To iden- 
tify a node for subsequent alteration, the light pen is activated on 
the node. Upon sensing that a node has been touched with the light pen, 
TREEPAK places a visible pointer beside the node on the screen to sig- 
nal to the user that the proper node has been identified (see Figure 4, 
node number 3). If a node has not been identified in this manner, an 
attempt to alter the structure will result in no action. After a node 
is identified, two basic operations may be used to alter the structure 
-- add and delete. 

The add function performs the operation of adding a node or a 
structure as a new successor to the identified node. When the user 
issues this command he is then given the option of adding the new 
structure in any position he wishes. Figure 4 shows node number 3 as 
the identified node after issuing the add command. The position of 
the new successor is chosen by placing the light pen on the asterisk 
which is in the desired location. The new successor may be created 
from the alphanumeric keyboard, or it may be a tree previously saved 
in the save area described in paragraph 3. 
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Figure 4* Adding a Successor to the Identified Node 

The identified node and its substructure may be deleted from 
the tree. The detached portion of the tree may be named and saved for 
further use, if desired, or it may be deleted from the system* The 
next paragraph contains a discussion of saving trees in the save area 
and retrieving them. 

3. Naming and Saving Trees 

The tree on display, or any part of it, may be saved for later 
reference. If a subtree of the current tree is saved, it is not neces- 
sary to detach that subtree, although it is possible to do so as dis- 
cussed in the previous paragraph. 

When placing a tree in the save area, the user may assign it 
a name* If a name is given which already exists in the save area, 
TREEPAK informs the user and replaces the old tree. Each tree saved 
is also assigned a number. The user is informed of this number and 
later references may be by name, if one was assigned, or by the as- 
signed number. Saved trees may be recalled at any time, either to 
become the new current tree, or for addition to a specified node. 
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A list of the names and numbers of trees in the save area can 



be displayed. If desired, trees may be dropped from the save area. See 
paragraph II. D. 5 for further information about the save area. 

D. IMPLEMENTATION 

Some of the techniques used in implementing TREEPAK are discussed 
in this section. The details of the coding are omitted here (the pro- 
gram listing is found in Appendix B) . 

1 . Coordinates for the Display 

To display a tree, the position of each node on the screen must 
be computed. The screen can be thought of as a Cartesian coordinate 
grid with the lower left corner of the screen assigned coordinate (0,0) 
and the upper right assigned coordinate (4095,4095). To compute the 
coordinates for each node in the display, the tree to be displayed must 
be traversed twice. On the first traversal, the number of levels and 
the number of nodes at each level to be displayed are counted. These 
numbers determine the distance between nodes on the screen. On the 
second traversal, the coordinates for each node are computed, using 
the information obtained on the first traversal and the node f s posi- 
tion in the tree. The following formulas are used for computing the 
coordinates of each node on the screen. H and V are the horizontal 
and vertical distance, respectively, between nodes. X and Y are the 
horizontal and vertical coordinates. 

H = 4096/ (No. of nodes at that level + 1) 

V = 4096/ (No. of levels 4* 1) 

X = H(No. of nodes already visited at that level + 1) 

Y = 4095 - V (Level of the node) 
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These computations are performed by TREEPAK each time a tree is 
displayed or altered. The number of levels and the number of nodes at 



each level are saved for use in locating the identified node as described 
below. From these numbers the actual node position and the vertical and 
horizontal distances between nodes can be computed at any time. 

2. Cycles in the Tree 

From the definition of a tree given in Section I«B, a tree does 
not contain cycles, i.e. , there is a unique path from the root of the 
tree to each node. In practice it is often useful for list-processing 
trees to contain cycles. The structure may still be pictured as a tree 
by repeating all overlapped nodes until none are left. Alternatively, 
the first node in a cycle may be treated as a terminal node on the sec- 
ond and subsequent visits. The former method produces an infinite tree 
if the tree is recursive. Thus, the latter method is the only practical 
way of implementing cycles in a graphic display. 

To determine when a cycle is encountered, each node must be 
marked as the tree is being traversed. Each time a node is visited, 
a test is made to determine whether the node has been marked. If so, 
then a cycle has been found. TREEPAK marks the nodes by changing the 
sign of the node degree* When a node is found which has a negative 
degree, this node is treated as a terminal node for the display, ex- 
cept that a is placed beside the node on the screen and the user 
is informed that the tree contains cycles. Changing the sign of the 
degree introduces the additional requirement of resetting the degrees 
to their original value after each traversal. 
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3* Finding the Identified Node 



When the light pen is activated on a node, the coordinates of 
the location at which the light pen is detected are used to determine 
which node is the identified node. The level of the node can be de- 
termined using the distance from the top of the screen to the point 
where the light pen is detected. Referring also to V from paragraph 1, 
Level = (distance from top of screen to detect) /V. 

The number of nodes on the left of the identified node at the same 
level is found using the distance from the left of the screen to the 
light pen detect, along with H from paragraph 1 as follows: 

No. on left = ((distance from left of screen)/H) - 1. 

These numbers uniquely describe a node in the tree. The tree is then 
searched until the node in that position is found, and a pointer to 
that node is saved. 

4. Altering the Structure 

Because of the internal representation used for the nodes, the 
amount of storage required to contain a node depends on the degree of 
the node. Altering the tree structure involves adding or deleting a 
successor, which changes the degree of the affected node. Thus, when 
the structure is changed, the node whose degree is changed must be re- 
placed with a new node of the appropriate size. Figures 5 and 6 show 
how TREEPAK handles additions and deletions. The number alongside 
each node represents an arbitrary memory address for the node. The 
organization of the nodes is as shown in Figure 3. 
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Figure 5(a) shows a tree of five nodes* Figure 5(b) shows the 
tree with node E deleted. Note that in Figure 5(a) the cell shown at 
memory address 140 is in the free storage area* This cell is obtained 
from free storage using the PL/ I ALLOCATE statement* The proper infor- 
mation from the old node B is placed into it. The pointer to B in node 
A is changed to point to the new cell. The old node B at address 100 is 
released to free storage using the PL/I FREE statement. 

Figure 6(a) shows the same tree as Figure 5(a). Figure 6(b) 
shows the tree after the addition of node F as a new successor to node 
D. In this example j the cell at address 190 is obtained to hold the 
new node. The cell at address 150 is released to free storage after 
the replacement has been made and the appropriate pointer in node A 
changed. 

5 * The Save Are a 

When trees are saved , only two items of data are recorded in 
the save area -- a pointer to the root of the tree and the name of the 
tree. The save area consists of the following PL/I structure 
declaration: 

DCL TOP FIXED BIN, 

1 SAVE BASED (P ) 9 
2 NUM FIXED BIN, 

2 ROOTS (0 : TOP REFER (NUM)), 

3 PNTR POINTER, 

3 NAME CHAR(10); 

NUM is the number of trees which are currently saved, determining the 
upper bound for the dimension of ROOTS* If no name is assigned to a 
tree its NAME is blank. The number assigned to the tree by TREEPAK 
corresponds to the subscript of ROOTS which contains the pointer to 
the tree. 
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(a) Before deleting node E 




(b) After deleting node E 



Figure 5. The Delete Operation 
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Figure 6. The Add Operation 
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There is no practical limit on the number of trees which can be 
saved. Since SAVE is declared as a BASED structure using the REFER^ 
option, it will be allocated as large as necessary to save the desired 
trees * 

Refer to the program listing in Appendix B for explicit details 
on the implementation of the various functions of TREEPAK. Appendix A 
contains instructions on the interface requirements of the main program 
and user operation of the graphic display console. A sample application 
is described in the next section. 



The REFER option is used in the IBM implementation of PL/I to 
specify that the value of a variable outside a based structure is to 
be used to determine the length or bound of a variable within the 
structure upon allocation. In this example the value of TOP at the 
time of the allocation determines the upper bound for the dimension 
of ROOTS. 
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III. APPLICATION PROGRAM 



A. DESCRIPTION OF THE WANG ALGORITHM 

This section gives an example of a program which uses tree struc- 
tures for representing strings of symbols in theorem proving. The 
algorithm is described in this section; a description of the implemen- 
tation is given in the next section* The program listing is found in 
Appendix D. 

The Wang Algorithm _/ 9_/ is a mechanical method of determining 
whether or not a formula in the Propositional Calculus is a theorem. 
The language of the Propositional Calculus is defined by the Backus 
Naur Form statements in Table 1. 

A sequent may be thought of as a statement which is true if and 
only if some formula in the string on the left side of the arrow (the 
antecedent) is false or some formula in the string on the right side 
of the arrow (the consequent) is true. That is, the conjunction of 
the formulas on the left implies the disjunction of the formulas on 
the right. The five connectives, NOT, AND, OR, EQUIV, IMPLIES, are 
given their usual logical interpretation. 

< SEQUENT) ::= <STRING> — -(STRING) 

<STRING> : := < EMPTY) ! <STR ING SET> 

<STRING SET) : := (FORMULA) | (STRING SET> , (FORMULA) 

<EMPTY> : := {The null string of symbolsj . 

(FORMULA) ::= (PROPOSITIONAL LETTER) | NOT( < FORMULA) )| 

( (formula) "(binary connective) (formula) ) 

(BINARY CONNECTIVE) ::= AND | OR | IMPLIES | EQUIV 

(PROPOSITIONAL LETTER) {An alphabetic letter 3 

Table 1. Definition of the Language 



28 



In general, the algorithm proceeds as follows. The formula to be 
tested is first preceded with the arrow to form a sequent in which the 
antecedent is the null string and the consequent is the formula to be 
tested. The consequent is then reordered into a form of Polish nota- 
tion. The connectives are eliminated from the sequent, beginning with 
the leftmost connective, according to ten rules of elimination. This 
process produces a finite set of sequents which contain strings of pro- 
positional letters on both sides of the arrow. The truth of each se- 
quent produced is then tested according to the following initial rule: 

1. If a,b are strings of propositional letters, then 
a — ►b is a theorem if and only if some propositional 
letter occurs on both sides of the arrow. 

Ten rules of derivation are listed below. In the rules, a and b 
are always strings (possibly empty) of propositional letters; c, d, e, 
and f are strings. It will be noted that there are two rules for each 
logical connective, one introducing it on each side of the arrow. Using 
these rules, the usual proof procedure would begin with a finite number 



2A. — ► NOT: 

2B, NOT : 

3A. — ► AND: 

3B. AND — ► : 
4A. — ► OR: 

4B. OR — ► : 

5A. -^-IMPLIES: 
5B. IMPLIES^ 

6A. —►EQUIV: 

6B. EQUIV— ► : 



If c,a-^b,d then a-ar-b ,NOT (c) ,d . 

If a,c— • fr*~d,e then a ,NOT(e) , c— *»-d . 

If a—fe^-b,c,d and a~>-b,e,d then 
a— ►bjCc AND e),d. 

If a,c,d,e— i*-f then a, (c AND d),e— te^f. 
If a— ►bjCjdje then a— ►b,(c OR d),e. 

If a,c,d— ^e and a,f,d— i*-e then 
a,(c OR f),d— ►e. 

If a,c — fr»b,d,e then a— ►b^c IMPLIES d) 
If a,c,d — and a,d — i*^e,f then 
a , (f IMPLIES d) , d — >-e . 

If c,a— ►b,d,e and d,a— ^-b,c,e then 
a_*^b, (c EQUIV d) , e , 

If c,d,a,e— #^f and a,e— ►f,c,d then 
a, (c EQUIV d),e~*-f. 



e. 



Table 2. Rules of Derivation 



29 



of cases of Rule 1, and using the ten derivation rules, introduce the 
proper connectives to arrive at the sequent which represents the original 
theorem. In contrast to the usual proof procedure, the Wang Algorithm 
begins with the original theorem and applies the rules backwards to ob- 
tain connective -free sequents. The rules are designed so that given a 
sequent, the leftmost connective may be eliminated, thereby resulting 
in one or two premises which, taken together, are equivalent to the con- 
clusion. This process is repeated until a finite set of sequents is 
reached which contain only propositional letters. The original sequent 
is a theorem if all of the sequents thus obtained are theorems when 
tested by the initial rule. 

The following example illustrates the proof procedure. The first 
line is the formula to be tested, with the sequent arrow prefixed to it. 
Successive lines are the sequents generated by each elimination of a 
connective. Each line is preceded by the number of the rule which pro- 
duced the sequent. 

>((NOT(P) AND NOT(Q) ) IMPLIES (P EQUIV Q) ) 

5A (NOT (P) AND NOT(Q))-*-(P EQUIV Q) 

3B NOT(P) ,NOT(Q)— 1KP EQUIV Q) 

2B NOT(Q)~*KP EQUIV Q),P 
2B — ►(P EQUIV Q),P,Q 
6A P-^Q,P,Q 
VALID 

6A Q-*»P,P,Q 
VALID 

The techniques used for implementing this algorithm are presented 
in the next section. 
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B. IMPLEMENTATION OF THE WANG ALGORITHM 



The implementation of the Wang Algorithm requires a structure for 
representing the sequents and a set of routines for manipulating the 
structure according to the rules stated in the previous section. The 
sequent can be conveniently represented as a tree. The root of the 
tree represents the sequent arrow. The root then has two successors, 
the antecedent and the consequent. Each formula is represented as a 
subtree of the antecedent if it is on the left side of the arrow, or 
the consequent if it is on the right side. A binary connective is 
represented as a node with two branches: one for each of the formu- 

las it connects. The unary connective, NOT, is a node with one suc- 
cessor, i.e., the formula it negates. The tree representation of the 
theorem shown in Figure 7 illustrates the structure. 




Figure 7. Representation of a Sequent 
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Certain changes in notation are made to facilitate the input of se- 
quents in this tree form. The notation described in Section II. C for 
the description of trees in linear form is used. Thus the expression 
(P IMPLIES Q) becomes (IMPLIES (P, Q) ) „ Using this notation, the state- 
ment of the theorem in the example is as follows: 

(SEQUENT (LEFT , RIGHT (IMPLIES (AND (NOT (P) , NOT (Q) ) , EQUIV (P , Q) ) ) ) ) . 

The theorem is input in this notation. The algorithm proceeds to 
eliminate the connectives one at a time. The routines eliminating the 
connectives are simply a set of tree manipulation routines which per- 
form transformations on the current sequent until all connectives are 
removed. Five of the rules generate additional premises which must be 
tested for validity. These new sequents are stacked as they are gen- 
erated. When the connectives are eliminated from the current sequent, 
it is tested and, if valid, the next sequent is obtained from the 
stack to become the current sequent. This process is repeated until 
all premises have been tested and the stack is empty. The program is 
then ready for a new theorem* 

A general flow chart for the program is shown in Figure 8. The 
photographs in Appendix C are taken from a sample run of this program. 
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Figure 8* Wang Algorithm Flow Chart 
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IV, CONCLUSION 



TREEPAK, described in Sections I and II, is designed to provide a 
user-oriented means for interacting with tree- type data structures. In 
Section III, an example was given of a program which uses tree struc- 
tures to represent strings of logic symbols in an on-line theorem tester. 
’This type of data structure occurs in many computer applications. TREE- 
PAK can be a valuable aid to the user for many of these applications 
which fulfill the interface requirements described in Appendix A. 

In Section II. D. 2, it was shown that any list may be displayed as 
a tree. This technique provides a useful picture of the structure if 
the number of cycles is small compared to the total number of nodes. 

If the structure contains many cycles it may still be displayed as a 
tree, but the user may find that it is hard to interpret the picture 
due to the many repeated nodes. Thus, the primary usefulness of TREE- 
PAK is in displaying and manipulating structures which are trees (con- 
tain no cycles) or "nearly" trees (contain few cycles). 



34 



APPENDIX A 



USER r S GUIDE 

The information contained in this appendix is not necessary for a 
general understanding of the system, nor is it intended to be a self- 
contained user’s guide. It is provided for the potential user and con- 
tains details on how to access and operate TREEPAK not contained in the 
previous pages. 

A. MAIN PROGRAM REQUIREMENTS 

Main program requirements include a declaration of the arguments 
used in the call, a declaration of TREEPAK itself, and the call to 
TREEPAK. The following is an example of the minimum main program 
which can successfully call TREEPAK: 

MAIN: PROCEDURE OPTIONS (MAIN); 

DCL P POINTER, MSG CHAR(74) VAR INIT( ,f ), 

TREEPAK RETURNS (POINTER) ; 

P = NULL; 

P = TREEPAK(P,MSG) ; 

RETURN ; 

END MAIN; 

This program contains all the necessary statements. Note that in 
this example a tree does not exist in the main program at the time of 
the call to TREEPAK. In this case, the first argument must have the 
value of the NULL pointer. No message is passed to TREEPAK for dis- 
play in this example, however, a character string of up to seventy- 
four characters may be passed as the second argument. When control 
returns to the main program a pointer to the root of the tree which 
was on display at the time of the return is placed in P. 
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If a tree is passed to TREEPAK, or if reference is made to the re- 
turned tree by the main program the structure of the nodes must be de- 
clared in the following format: 

DCL TOP FIXED BIN, 

1 NODE BASED (P), 

2 NAME CHAR(IO), 

2 DEGREE FIXED BIN, 

2 BRANCH (0 : TOP REFER (DEGREE)); 

When a tree is passed to TREEPAK for display a pointer to the same tree, 
which may have been altered by the user at the console, is returned to 
the main program. 

It is permissable for the main program to access the graphic display 
unit directly without going through TREEPAK. The only restriction is 
that the data control block and attention handler used for the direct 
access must be closed prior to calling TREEPAK, TREEPAK closes its 
graphics data control block and attention handler before returning to 
the main program. 



B. JOB CONTROL STATEMENTS 

Job control statements required are those normally required for 

running a PL/I job and statements required for loading a copy of 

TREEPAK and WJS999. The following is an example of those statements 

required at Naval Postgraduate School: 

// Standard job card,TYPRUN=HOLD 
// EXEC PL1LFCLG, REGION, G0= 15 OK 
//PL1L.SYSIN DD * 

User's main program 



/* 

//LKED.LIBA DD DSN=S0373 . PLIGPAK, UNIT=2314 , VOL=SER=LINDA, 
DISP=SHR 

//LKED.SYS IN DD * 

LIBRARY LIBA (TREEPAK, WJS999) 
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/* 

//GO. DISPLAY DD UNIT= (2250- 1) 

//GO. SYS IN DD * 

Trees in list notation if desired (two cards per tree) 

/* 

The job card is a standard job card except that TYPRUN=HOLD is 
required for all jobs which use the graphic display unit at NPS. The 
data definition statement named LIBA describes a partitioned data set 
which contains two members, TREEPAK and WJS999, both in load module 
form. The data definition statement for the graphic display unit 
must be named DISPLAY. 



C. USING THE PROGRAMMED FUNCTION KEYBOARD 



Only those keys which are active at any particular time are lighted. 
For example, if no node has been identified, the keys which require an 
identified node are not lighted, if no trees have been saved, the key 
which retrieves a saved tree is not lighted. The following is a de- 
scription of the function of each key. 



PFK Key Function 

0 Read and display a tree from SYSIN , Two cards are 

read from the GO. SYSIN data set. The list descrip- 
tion of a tree must appear on those cards. The list 
may begin any place on the cards, but there must not 
be any intervening blanks between the beginning and 
end of the list. Syntax errors cause an error message 
to the user and the "card images are then displayed on 
the screen. At this. time the user may correct the er- 
ror found and create the tree by pressing PFK key 1. 

If no more cards are available to be read, a message 
to the user is generated. 



1 Read and display a tree typed at console keyboard . 

This function causes the typing area of the screen 
to be read. The subsequent actions are the same as 
those for PFK 0 after reading the cards. The im- 
portant point to note is that the list must be typed 
in before pressing the key. 
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PFK Key 
2 



4 



5 



6 

7 

8 

10 



Function 



Save displayed tree in save area . The current tree is 
saved. If the tree is to be assigned a name, the rictme 
must be typed in before pressing the key. The name may 
be up to ten alphanumeric or special characters. If an 
identical name already appears in the save area, the old 
tree is replaced. The user is then informed of the 
number assigned to the tree by TREEPAK. 

Display identified node structure . Execution of this 
function does not change the structure of the tree, it 
merely changes the display screen so that the identified 
node appears as the root. Only that portion of the cur- 
rent tree which is the substructure of the identified 
node appears. 

Add structure to identified node . This key causes an 
asterisk to appear at all the possible positions in 
which a successor could be added to the identified node. 

A message instructs the user to type in the new successor, 
then touch the desired position with the light pen. Ei- 
ther a save number, a saved name or the list notation for 
a tree may be typed in. After the light pen is activated 
over one of the asterisks, the tree will be placed on the 
screen with the added structure in the position indicated. 
If, after pressing the key, the user does not wish to add 
a successor, the display may be reinitialized by using 
PFK key 18. 

Delete identified node . The identified node and all that 
it dominates is deleted from the current tree. Pressing 
this key when the identified node is the root of the cur- 
rent tree will result in an error message. (See PFK 10 
for deleting an entire tree from the save area). 

Detach identified node and save . This operation is the 
same as for PFK key 6, except that the deleted subtree 
is saved in the save area. If the subtree is to be as- 
signed a name, it must be typed in before pressing the 
key. 

Save id entifie d node . The subtree dominated by the iden- 
tified node is saved, however, the structure of the cur- 
rent tree is not changed. The name assigned, if applicable, 
must be typed in before pressing the key. 

Delete a tree f rom save a rea . The name or number of a 
tree in the save area must be typed in before pressing 
the key. That tree is then dropped from the save area, 
and will no longer be accessible by that name or number. 
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Function 



FFK Key 

11 Display a tree from save area . The name or number of a 

tree which has been saved must be typed in before press- 
ing the key* This tree then becomes the current tree 
and is displayed on the screen* If the tree which was 
current at the time the key was pressed has not been 
saved, it will no longer be accessible* 

16 Display names and numbers of trees in save area * A list- 
ing of the trees which have been saved is presented, in- 
cluding their names and numbers* When this display is 

on the screen, PFK 18 must be pressed before any further 
action is taken* All other functions are disabled during 
this time. 

17 Display instructions * The underlined portions of the 
instructions printed here are displayed* PFK 18 must be 
pressed before continuing* 

18 Terminate instruction display and/or reinitialize display . 
The display returns to the current tree if either PFK key 
16 or 17 has been pressed. This key may also be used to 
reinitialize the display if PFK 5 is inadvertently pressed 
and the user does not wish to add any successor to the 
identified node. 

31 Return to the main program * The display is terminated, 

the console interrupts disabled, and the data control 
block is closed before returning to the main program. 

A pointer is returned to the root of the tree which was 
passed as an argument when TREEPAK was called, or if the 
argument was the NULL pointer, a pointer to the root of 
the tree on display is returned* If the user wishes to 
return a character string to the main program, he may 
type it on the keyboard before returning. The character 
string will be placed in the character string which was 
used as an argument in the call. 
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/* PARAMETER TABLE FOR PASSING VALUES TO GCPRNT ROUTINE */ 
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OCL ( F I ND_NO» NUMBER i STO_ROOT f SAVE_NO ) RETURNS (FIXED BIN); 
/* VARIABLE USED TO SAVE POINTERS TO TREE ROOTS */ 
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/* TEMPORARY LINEAR LIST TO HOLD NODE INFORMATION FOR TREE */ 
DCL P POINTER; 

DCL 1 TEMP BASED IP). 
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Tree representation of a sequent. 




Result of removing "AND" from the preceding 
sequent o 




Node number 4 is the identified node. The PFK 
key for "add" has been pressedo The subtree 
to be added is typed at lower left. 




Result of touching the light pen to the left 
asterisk in the preceding tree. 



6? 






General Instruction display. 




A tree which was created at the console keyboard. 
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Node number 1? is the identified node. The 
"add" PFK key has been pressed. 




After adding the new subtree to node 1? in 
the preceding tree. 
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